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A search is performed for heavy metastable particles that decay into jet pairs with a macroscopic 
lifetime (cr ~ 1 cm) in pp collisions at y/s = 1.96 TeV using data from the CDF II detector at 
Fermilab corresponding to an integrated luminosity of 3.2 fb~ . To estimate the standard model 
background a data-driven approach is used. Probability-density functions are constructed to model 
secondary vertices from known processes. No statistically significant excess is observed above the 
background. Limits on the production cross section in a hidden valley benchmark phenomenology 
are set for various Higgs boson masses as well as metastable particle masses and lifetimes. 

PACS numbers: 13.85.Ni, 12.60.Fr, 14.80.Da 
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The standard model (SM) of elementary particles fails 
at TeV energies if no new phenomena appear at this scale. 
A single Higgs boson provides the simplest solution, but 
there are other possibilities, some of which predict mas- 
sive metastable particles. They are metastable because 
they can only decay to SM particles through diagrams 
containing a new high-mass force carrier or a loop of very 
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massive particles. These model are broadly categorized 
as "hidden valley" (HV) models We use data from 
pp collisions at \fs = 1.96 TeV collected with the CDF II 
detector at the Fermilab Tevatron collider to search for a 
long-lived massive particle that originates from the pri- 
mary pp interaction, travels a macroscopic distance (of 
order 1 cm), and decays into jet pairs. A variety of pre- 
dicted decay modes are possible for these metastablc par- 
ticles. Although the search is sensitive to any massive 
long-lived particle decaying into jet pairs, for the sake 
of specificity we choose as a benchmark to evaluate the 
results within the context of the HV phenomenology. 

A recent analysis from the DO experiment searched for 
heavy particles decaying with a displaced vertex Q , us- 
ing the same phenomenological model as this analysis. 
However it was restricted to heavy metastable particles 
that decay into b quarks because their trigger required a 
muon in the event. We have no such limitation because 
CDF employs the silicon vertex trigger (SVT) which al- 
lows us to trigger on tracks that originate from displaced 
vertices 0, H| . Thus our search is sensitive to metastable 
particles that decay into any jets, not only &-quark jets. 

We search for an event signature where two jets of par- 
ticles emanate from a point displaced from the primary 
interaction point, i.e., a displaced or secondary vertex. 
Monte Carlo (MC) simulation is used to model events 
from the HV phenomenology. It serves as our benchmark 
for processes containing the signature of the search. Since 
the SM does not contain massive metastable particles, 
we expect little background. We construct a background 
model almost entirely from the data. To accomplish this, 
the kinematics of the data events with secondary vertex 
characteristics are determined from auxiliary SM sam- 
ples. 

After a brief overview of the CDF II detector in Sec.lHl 
the HV phenomenology is described in Sec. IIII1 Sec- 
tion [TV] discusses the event selection for the analysis, and 
Sec.[V]presents the background estimate along with a test 
of the method. The search for the signal is presented in 
Sec. I VII Systematic uncertainties for both the expected 
signal and the background are presented in Scc. lVIII Sec- 
tion IVIIII presents limits on the production cross-section 
in the HV phenomenology. 



II. THE CDF DETECTOR 

CDF is a general-purpose detector that is described 
in detail in Ref. The detector components relevant 
to this analysis are briefly described here. Closest to 
the beampipe are multilayer silicon detectors (SVX) @ 
providing precision tracking which is used to identify dis- 
placed vertices. Outside the multilayer silicon detectors 
is an open-cell drift chamber, the central outer tracker 
(COT), covering the pseudorapidity region |r/| < 1 0. 
(The pseudorapidity -q is defined as — ln[tan(#/2)], where 
9 is the polar angle relative to the proton beam direc- 
tion Q.) The COT is used to reconstruct charged par- 



ticles' momenta. The tracking system is enclosed in a 
superconducting solenoid operating at 1.4 T, which in 
turn is surrounded by a calorimeter. 

The CDF calorimeter system is separated into elec- 
tromagnetic and hadronic components segmented in a 
projective tower geometry covering the region \rj\ < 3.6. 
The electromagnetic calorimeters use a lead-scintillator 
sampling technology [9[, whereas the hadron calorime- 
ters use iron-scintillator technology [Io|, [H| ■ Jets are re- 
constructed from the energy deposited in these calorime- 
ters [l2[. The calorimeter is separated into the central 
(|r?| < 1.0) and forward (or plug) regions (|7y| > 1.0). 

Finally, the muon subdetectors are arrayed outside the 
calorimeters. The beam luminosity is determined with 
gas Cherenkov counters located in the region 3.7 < |?7| < 
4.7 which measure the average number of inelastic pp 
collisions per accelerator bunch crossing 

CDF uses a three-level trigger system, with a mix of 
hardware electronics and dedicated CPUs to select in- 
teresting events. In our analysis, a trigger sensitive to 
Z — > bb decays is used, the ZBB trigger. It employs the 
SVT hardware in the second-level trigger. This trigger is 
described in more detail in Sec. [TV] 



III. THE HIDDEN VALLEY MODEL 
A. Phenomenology 

While the analysis presented here is a search for any 
heavy particle that decays into a pair of jets at a dis- 
placed vertex, it is useful to have a benchmark model. 
The HV phenomenology provides a framework in which 
we can generate signal Monte Carlo samples, search for 
discriminants, optimize our search, and compare results. 
Results presented for this benchmark process can be used 
to constrain other models by accounting for the differ- 
ences in the kinematic properties of the final state. Here 
we present a brief outline of the HV picture. 

In the HV scenario, the standard model gau ge g roup 
Gsm is extended by a non Abelian group G v (ll ll4j|. SM 
particles are neutral under G v . Additionally, G v contains 
new particles, called v-particles, that are charged under 
G v but neutral under Gsm- 

In the particular class of hidden valley models con- 
sidered here, the G v gauge group may become strong 
and confine, analogously to QCD. The w-particles, called 
w-quarks in this class of models, are confined inside v- 
hadrons. Energetic collisions at the Tevatron could cre- 
ate new particles, such as the Higgs boson or a new Z' 
resonance, that could decay to HV particles. If the light- 
est w-hadrons are sufficiently heavy, they can decay to 
SM particles via highly suppressed processes, e.g., mix- 
ing with the longitudinal component oi a Z'. A wide 
range of masses, lifetimes, and final states are possible 
within the HV framework. If the lightest available HV 
particle, a v-n in our benchmark model, has mass less 
than twice that of the top quark, the predominant de- 
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cay would be to bb quark pairs. With a long lifetime, 
some particles would travel a measurable distance from 
the primary vertex before decaying, much like a B or D 
hadron. 

Applying the HV phenomenology to astroparticle 
physics, the authors in Ref. [IH theorize that the ex- 
istence of a dark matter candidate implies another (HV) 
particle with a lifetime such that cr is of the order of 
1 cm. This places the lifetime within the range accessi- 
ble to the CDF II detector. 

The HV also provides a way to search for the Higgs 
boson. If the Higgs boson mixes with a scalar in the 
HV sector that couples to v-quarks, then it may decay 
to two (or more) ii-hadrons. These w-hadrons would in 
turn decay into bb quark pairs. It would be feasible to 
search for the Higgs boson using this final state at CDF. 
Under some HV scenarios, the branching fraction to HV 
particles could be comparable to those of SM decays. In 
addition, searches for these HV decays may have higher 
signal to background ratios due to their unique decay 
topology. A Feynman diagram of this decay is shown in 
Fig.ffl 




FIG. 1: Feynman diagram of Higgs boson production of a 
hidden valley particle and its subsequent decay. The coupling 
of HV particle to bb is extremely small, resulting in the long 
lifetime. 



B. Monte Carlo Simulation Samples 

The Pythia Monte Carlo program version 6.2 [lj| is 
used to generate the events for the signal MC simula- 
tion. Geant3 is used for the detector simulation [l7j . 
To mimic HV production and decay in Pythia, we use 
the minimal supersymmetric standard model process of 
a CP-even Higgs boson (h ) decaying into two CP-odd 
Higgs bosons (a°) which in turn decay into 6-quarks, 
h° —> a°aP — s- bb bb. We alter the mass and lifetime 
of the a in order to simulate the HV particle. This al- 



lows for the generation of signal MC samples using the 
Pythia MC generator without significant modification. 

Two Higgs boson masses arc generated, one at rel- 
atively low mass, 130 GeV/c 2 , and one at higher 
mass, 170 GeV/c 2 . Multiple HV particle masses from 
20 GeV/c 2 to 65 GeV/c 2 arc produced. The cr of the 
HV particle, cthv, is set to 1.0 cm. Some signal MC 
samples have been weighted to study HV particles with 
cthv of 0.3, 2.5, or 5.0 cm. Thus we can study mul- 
tiple HV lifetimes without generating additional signal 
MC samples. 



C. Discriminants from signal MC simulation 

The major characteristic that distinguishes the signal 
from the SM backgrounds is the presence of two jets 
whose momentum vectors both point to a common sec- 
ondary vertex. With this in mind, we developed two 
discriminants shown in Fig. [5] and Figs. G2 ij) and £. In 
both cases, the figures are drawn in the two-dimensional 
plane transverse to the beam line. 

We define a "tagged jet" as a jet with a reconstructed 
secondary vertex using criteria defined in Sec. IIV1 By 
definition, such a jet has both a position (the secondary 
vertex) and a direction defined by the sum of momenta 
of the tracks that make up the vertex, where all these 
quantities are defined in the transverse plane. We define 
tp as the orthogonal vector from the primary vertex, the 
reconstructed location of the pp collision, to the line de- 
fined by the secondary vertex position and momentum 
direction. The magnitude of ip is the distance from the 
primary vertex to the line, i.e., its impact parameter. We 
take the sign of i/j as that of the dot product between ijj 
and the momentum of the tagged jet. The distribution of 
tp for simulated signal events has much larger tails than 
simulated background events. 

The variable £ is defined for events where there are two 
tagged jets; see Fig. [3l The intersection of the two tag 
momenta can be thought of as the reconstructed decay 
vertex of the HV particle. The vector from the primary 
vertex to this reconstructed decay vertex is t ; the magni- 
tude, C, is the reconstructed two-dimensional decay dis- 
tance of the HV particle. 

The sign of £ is determined by taking the dot prod- 
uct between £ and the vector sum of the momenta of the 
two jets with tags. The sign effectively indicates whether 
or not the decay vertex is in the same hemisphere of the 
detector as the jet pair. Signal MC events have more pos- 
itive £ than negative, while the background MC events 
have £ uniformly distributed around zero. 

At this point, it is necessary to discuss the combina- 
torics of the HV event topology. With MC simulation, 
we can use the generator-level information to evaluate if 
the jets with secondary vertices originated from quarks 
whose mother is the HV particle. Using this informa- 
tion, we define four possible topologies in which signal 
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FIG. 2: Schematic diagram of variable ip, the impact parameter of a jet with a secondary vertex. This figure is not to scale. 
The figure is shown in a plane transverse to the beam line. 



MC events can be classified. 

1. Two-tag HV: both jets originate from the same 
HV particle. 

2. One tag each: each jet originates from a different 
HV particle. 

3. One HV jet: one jet originates from a HV particle; 
the other does not. 

4. No HV jets: neither jet originates from a HV par- 
ticle. 

Figure |4] shows the (a) ip and (b) ( distributions for dif- 
ferent signal MC simulation topologies and background 
MC simulation (Pythia QCD bb). The signal MC as- 
sumes M H = 130 GeV/c 2 , M RV = 40 GeV/c 2 , and 
ct hv = 1-0 cm. These distributions have been normal- 
ized to unit area and show the discriminating power of 
both variables. 



For the two-tag HV topology, the distribution of £ is 
nearly always positive, which improves the discrimina- 
tion against the background. The one-tag-each topology 
is a distribution that is both positive and negative, but 
mostly negative. Between the two-tag HV and the one- 
tag-each topologies, we concern ourselves with the first 
because it has more discriminating power, and if a signal 
is seen, the HV particle's invariant mass can be recon- 
structed. The final two topologies are very rare, but are 
included in the figures for completeness. 



In addition to these two variables, the separation of 
the two jets {AR = ^{A<p) 2 + (A?/) 2 ) is a useful dis- 
criminant (see Fig. Hfc). In the two-tag HV topology, the 
decay daughters of the HV particle are more collinear 
than the one-tag-each topology, but still different than 
the QCD background, which is dominated by gluon split- 
ting at low AR, once a AR < 2.5 cut removes most of 
the direct bb production. 
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FIG. 3: Schematic diagram of variable (, which represents the decay vertex of the HV particle. 



IV. EVENT SELECTION 
A. ZBB trigger 

Z —> bb events are collected at CDF for the pur- 
pose of studying the jet energy scale (JES) of 6-quark 
jets [IH This is achieved by means of a specially 

designed trigger, the ZBB trigger (Table [TJ which selects 
events containing tracks with a large impact parameter 
with respect to the primary vertex (do). Because a HV 
particle would decay at a displaced vertex, tracks from 
this decay would have large do- Thus we use this trig- 
ger for our signal search. The total integrated luminosity 
collected with the ZBB trigger is 3.2 fb _1 . 

At leveLl, the trigger has two requirements. It selects 
events with at least one central calorimeter tower with 
Et > 5 GeV and at least two tracks, one with px > 
5.48 GeV/c, the other one with p T > 2.46 GeV/c. 

At level_2, there are two different paths, called op- 
posite side (OS) and same side (SS), which refer to 
the topological configuration of the displaced tracks in 



the event. Both paths contain a veto on jets in the 
plug calorimeter and a central calorimeter requirement. 
The plug jet veto requires that there are no calorime- 
ter clusters with Et > 5 GeV in \rj\ > 1.1 and is 
designed to reduce the QCD background, which pro- 
duces more gluon radiation than does the signal. The 
trigger requires at least two central calorimeter clusters, 
Et > 5 GeV and |r/| < 1.1, which are on opposite sides 
of the calorimeter. Finally, the calorimeter clusters must 
have in total at least two displaced SVT tracks, with 
track pt > 2 GeV/c, 160 /im < |d | < 1000 (j,m, sec- 
ondary vertex fit x 2 < 12. For the OS path, the two 
tracks must have 150° < Acf> < 180°. The SS path re- 
quires that the two displaced tracks point to a single 
cluster, and that 2° < A(f>< 30°. 

At level_3, the trigger requires at least two jets with 
Et > 10 GeV and |?7| < 1.1. Jet clustering uses a cone 
algorithm of size AR = 0.7. The trigger also requires 
at least two tracks with 160 /jm < \do\ < 1000 fim, to 
confirm the level_2 requirements. As a cross-check this 
requirement is performed with both SVT tracks and COT 
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TABLE I: ZBB trigger requirements. One of the two level_2 paths, opposite side or same side must be satisfied. 



LeveLl at least one central calorimeter tower with Et > 5 GeV; 

at least two tracks: one track with pr > 5.48 GeV/c, one with pr > 2.46 GeV/c 
LeveL2 veto events with a calorimeter cluster with Et > 5 GeV, 1.1 < \rj\ < 3.6; 

require at least two clusters Et > 5 GeV, \r)\ < 1.1, which have 135 < A<j) < 180; 

at least two SVT tracks with p T > 2 GeV/c, 160 /im < |do| < 1000 fim, \ 2 < 12 
(OS) tracks have 150° < A(f> < 180° 
(SS) tracks have 2° < A0 < 30° 

Level_3 at least two AR = 0.7 jets with E T > 10 GeV, \r)\ < 1.1; 

at least two SVT tracks with p T > 2 GeV/c, \rj\ < 1.2, 160 (jm < \d Q \ < 1000 /xm; 
at least two COT tracks with p T > 1.5 GeV/c \rj\ < 1.2, 130 ^m < \d \ < 1000 /jm, 
track impact parameter significance S(do) > 3, \Az\ < 5 cm 



tracks, with additional impact parameter significance and 
\Az\ requirements imposed on the COT tracks. The track 
parameter zq is the distance in the z direction from the 
detector origin to the point on the z axis closest to the 
track helix. The quantity \Az\ is the magnitude of the 
difference in zq between the two COT tracks. The cut on 
| Az\ ensures both tracks originate from the same primary 
vertex. The same level_3 requirements are imposed on 
both OS and SS level_2 paths. 

After the trigger selection, there is a further jet classi- 
fication. Jets in the analysis stage are reconstructed with 
a AR = 0.4 cone. The larger radius jet cone used in the 
trigger provides high trigger efficiency. In the analysis, a 
cone of 0.4 is used to avoid unnecessary merging of jets. 
The Et of the jet, after being corrected to the hadron 
scale (E T ° r ) must be greater than 20 GeV. The jets must 
be in the central region of the detector, |r/| < 1.0. This 
requirement overlaps with the ZBB trigger requirement. 
These jets are referred to as "tight-central" jets. How- 
ever, as explained below, there are instances where non- 
tight-central jets are used in this analysis. 



B. Signal and control regions 

While the HV phenomenology predicts four jets in the 
final state, we allow events with three jets in order to 
increase our acceptance. In addition, the plug jet veto in 
the ZBB trigger at levcl_2 reduces jet multiplicity (while 
simultaneously lowering the trigger rate at high luminos- 
ity). Thus, in order to maintain acceptance, the signal 
region is defined with three or more tight-central jets. 

The signal MC samples show that the opening angle 
between the two jets is not back-to-back, but instead 
usually smaller. The lighter the HV particle, the smaller 
the opening angle between the jets. Thus, for each jet 
pair in a 3-or-more jet event, we apply a cut of AR < 2.5 
on each pair. Events which pass the jet multiplicity cut 
and have a jet pair passing the AR cut are said be in the 
signal region. 

In addition to the signal region, we define a two-jet con- 
trol region to validate our background estimation tech- 



nique on a set of events that is devoid of signal. This 
control region is defined as follows: events are required 
to have exactly two tight-central jets with no AR re- 
quirement, and if additional jets are present, they must 
have uncorrected Et < 15 GeV. The control and signal 
regions are mutually exclusive. 



C. Secondary vertex tagging 

Secondary vertex tagging is used in this analysis to 
identify jets with displaced vertices. We modified the 
standard CDF algorithm SECVTX to increase the 
efficiency for very long-lived particles, such as the HV 
particle, by extending the maximum impact parameter 
allowed for tracks used in vertexing. Twenty | dolman cuts 
between 0.15 cm and 1.6 cm are studied to maximize the 
signal while minimizing the increase in mistags. (Mistags 
are light-flavor quark or gluon jets erroneously tagged as 
having a displaced vertex.) 



V. BACKGROUND ESTIMATION 

We want to produce a background estimate that re- 
tains the kinematic correlations in QCD multijet events 
but has secondary vertices modeled on those observed 
in SM processes. Toward this end, we use the jet kine- 
matics of our primary data sample, and to each event, 
we add secondary vertices whose properties come from 
other data samples. The vertices in these samples are 
characterized by probability-density functions (PDFs) as 
a function of jet energy, flavor, and the number of dis- 
placed tracks available to the ZBB trigger. 

The ability to find particles with displaced vertices re- 
lies on the reconstruction of secondary vertices. These 
secondary vertices can come from multiple SM sources, 
which are listed in Table [TT1 
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FIG. 4: Distributions of (a) ip, (b) f, and (c) AT? for dijets 
where both jets are tagged. The signal MC simulation his- 
tograms are stacked. In (a), only the leading jet of the jet pair 
is shown. In (c), a AR < 2.5 cut has already been applied. 



A. Building PDFs 

The first step in modeling the background is to build 
the standard model secondary vertex PDFs in tagged 
jets. The PDFs are constructed from data events, when 



TABLE II: Standard model processes that can result in jets 
with reconstructed displaced vertices. 



Background SM Production 

6-quarks QCD bb, it, W/Z+jets, WZ/ZZ 

c-quarks QCD cc, W/Z+jet, WZ/ZZ 

light-flavor (mistags) QCD qq & gg, hadronic ts 



possible, where the signal is not expected to be present, 
and in effect encapsulate SM secondary vertex informa- 
tion. 

The pdf variables arc defined in the plane transverse 
to the beam line. We define the variables for the sec- 
ondary vertex with respect to the parent jet momentum 
vector, also called the jet axis. First, define L xy as the 
two-dimensional vector from the primary vertex to the 
secondary vertex. There are two components to this L xy 
vector, one parallel to the jet axis, one perpendicular. 
These two components are the first two pdf variables and 
are named u and v. These two variables define the posi- 
tion of a secondary vertex with respect to a jet axis. 

The two-dimensional A<j) angle between the jet axis 
and the secondary vertex momentum vector is the third 
PDF variable, named a. This variable defines the direc- 
tion of a secondary vertex with respect to a jet axis in 
the plane transverse to the beam. 

We find that correlations exist among all three vari- 
ables. To preserve these correlations, we store these 
PDFs in three-dimensional histograms. 

We split these PDFs into three main categories for dif- 
ferent quark flavors: 6-quark, c-quark and light-flavor 
jets. These are further split into different bins of Ej? 1 
and the number of SVT tracks satisfying the ZBB-trigger 
SVT requirement. We separate jets into bins of zero, one, 
and two or more SVT tracks, as the ZBB trigger requires 
two displaced tracks in the event. We choose this bin- 
ning because the PDFs' shapes are different in each bin, 
due to the dependence of secondary vertex production on 
these quantities. 

Different data sources are used to construct the dif- 
ferent quark flavor PDFs. We use a muon trigger with a 
relatively low px requirement to build the 6-quark PDFs. 
These data are rich in B hadrons which decay semilepton- 
ically. To select events, we use a two-jet selection where 
one jet is required to have a muon present within its jet 
cone, and both jets are tagged while being well separated 
in the detector (A</> > 2.0). The nonmuon jet, called the 
away-jet, is the jet used to construct the &-quark PDFs. 

For the light-flavor PDFs, we use the various CDF 
QCD jet triggers which collect a large number of QCD 
multijet data events. These data must have their heavy 
flavor contribution subtracted in order to isolate the 
light-flavor events with secondary vertices; in effect, these 
are mistag PDFs. The flavor composition of the jet trig- 
gers is calculated using QCD MC templates of &-quark, 
c-quark, and light-flavor secondary vertices. (In general, 
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we refer to the secondary vertex modeling derived from 
data as PDFs, while reserving the word "template" to de- 
scribe information obtained from MC samples.) Pythia 
QCD MC samples were generated with multiple momen- 
tum thresholds. The two samples we used in this analysis 
are QCD to 66 and generalized QCD with no final-state 
filtering. The former is mostly used when comparing 
the signal MC in order to determine discriminants, see 
Sec. IIII1 The latter is used to construct the MC templates 
for the purposes of estimating the background. 

The vertex mass squared is the square of the sum of 
the four-momenta of the tracks that form the secondary 
vertex, where the mass of the track four-momentum is 
set to the mass of the pion. We determine the flavor 
composition of the jet trigger data by fitting the QCD 
MC templates of vertex mass to the data. Heavy flavor 
shapes (b- and c-quark contributions) are subtracted from 
the jet trigger data, using the QCD MC as the source of 
the shapes of the b- and c-quark distributions for u, v, 
and a. 

In order to ensure that the b- and c-quark shapes for 
subtraction accurately represent the data, we compare 
the 6-quark PDFs from the muon trigger to the QCD 
MC simulation where the jets are matched to a B hadron. 
The ratio of the means of each PDF variable is calculated 
as the scale factor. The shapes of the distributions of 
these variables are the same after applying these scale 
factors to the PDF variables for 6-quark jets from the 
QCD MC simulation. These scale factors (SF U ~ 0.99, 
SF V ~ 1.39, SF a ~ 1.04) then are applied to the PDF 
variables for both 6-quark and c-quark jets from the QCD 
MC simulation when generating the templates used in the 
subtraction procedure. 

Finally, c-quark jets are not readily identifiable in real 
data. Therefore we use QCD MC simulation in order to 
collect jets for the c-quark PDFs. This is not a serious 
limitation because we find that the ZBB data has a very 
small charm-quark component. 

Two-dimensional projections of a 6-quark PDF for jets 
with E T from 30 to 70 GeV and one SVT track within 
the jet are shown in Fig. [5j 
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B. Pseudoevent generation 

We build a background estimate using pseudoevents 
which are produced by applying the secondary vertex 
PDFs to jets in events from the ZBB trigger data. While 
the kinematic information comes from the real event, the 
PDFs arc used to characterize secondary vertices from 
SM sources in the jets. 

The background estimate is done using the same ZBB 
trigger sample used to search for the signal. Events in the 
signal and control regions are separately selected. When 
generating pseudoevents, a pair of tight-central jets is 
used. For the signal region, this is the dijet system where 
AR < 2.5. (In principle, there could be multiple dijets 
in a three-or-more-jet event, but in practice, we find only 



FIG. 5: Two-dimensional projections of a 6-quark PDF for 
jets with E T from 30 to 70 GeV and one SVT track within 
the jet: (a) it vs v, (b) a vs it, and (c) a vs v. 



one dijet system in each event.) In the control region, 
the dijet system is simply the two tight-central jets in 
the event. 

Before the pseudoevents are generated, we must under- 
stand the ZBB trigger data. First, we obtain the dijet tag 
probability of real dijets in the ZBB trigger data. This 
is the probability that both of the jets are tagged. The 
purpose of this dijet probability is to preserve kinematic 
correlations that may exist with respect to tagging. The 
dijet probability is calculated in terms of both the Et of 
the jets and the number of SVT tracks, as the probability 
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of a tag changes with these variables. 

Second, we obtain the flavor composition of the dijets 
in the ZBB trigger data. However, unlike the flavor com- 
position of the JET trigger samples, where we were con- 
cerned with single jets, here we are concerned with the 
flavor composition of pairs of jets. With three possible 
flavor categories: 6-quark (B), c-quark (C), and light- 
flavor (L), and two jets, there are nine possible combi- 
nations of double flavors for a pseudodijet: BB, BC, BL, 
CB, CC, CL, LB, LC, and LL; where mixed states such 
as the BC and CB states are not considered degenerate. 
The first letter describes the flavor for the leading Et jet 
and the second letter that of the subleading Et jet. 

We use two-dimensional fits of the vertex mass of the 
two vertices to determine the flavor of jet pairs with 
tags. We reuse the Pythia QCD dijet MC templates 
of the vertex mass. First, the individual &-quark, c- 
quark, and light-flavor MC templates arc joined to form 
two-dimensional vertex mass templates for BB, BC, BL, 
etc. Then, the two-dimensional vertex mass templates 
are merged to form a single vertex mass template that 
encompasses all nine double-flavor states. Because the 
nine double flavor fractions must add to one, there are 
eight fractions that we fit, which are algebraic combina- 
tions of the nine double-flavor states. Fits are performed 
using the RooFit package [2l|; an example is shown in 
Fig. H3 and the fit result is shown in Table IIIII 



TABLE III: Double-flavor fraction fit results for dijets where 
each jet has exactly one SVT trigger track. 
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The pscudocvent generation process is as follows: with 
a jet pair selected, from either the control region or signal 
region, we proceed to generate tags for the jets in the 
pair. The probability of both jets having tags, calculated 
from the ZBB data, is used to assign whether or not both 
jets have a tag in the pseudoevent. 

Next, the flavor of the pseudojet is generated. Using 
Table IIIII as an example, if both jets have exactly one 
SVT trigger track, there is a 91.28% probability that the 
jet pair is BB, or two 6-quark jets, a 4.78% chance that 
the pair is BL, where the leading Et jet is a 6-quark jet 
and the subleading Et jet is light-flavor, etc. 

The secondary vertex information is sampled from the 
PDFs, generated using background processes. In this 
step the jets are sampled independently. The sampling 



FIG. 6: Example of double-flavor fits for dijets where each jet 
has exactly one SVT trigger track. Histograms are projections 
of the two-dimensional fit onto the axis of each jet: (a) the 
higher Et jet in the event, (b) the lower Et jet. 

is performed on the three-dimensional histogram where 
the PDF information is stored. Random u, v, and a are 
chosen according to the PDF's distribution and assigned 
to the pseudojet. 

To complete the process, a pseudoevent is generated 
for each event in the ZBB trigger data (which are part of 
the control or signal regions), thereby creating "pseudo- 
data" with the proportion of secondary vertices and the 
flavor composition derived from the ZBB trigger data, 
and the secondary vertex information corresponding to 
SM sources via the PDFs. 



C. Validation 

We use the control region to validate this algorithm. 
Because we expect the control region to be devoid of 
signal, we can compare the real dijet data to the pseu- 
dodijets generated to see if the pseudoevent generation 
replicates the data. For the purposes of this validation, 
exactly one pseudoevent is generated for each real event, 
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and the PDFs are only sampled once for each pseudojet. 
Distributions of the control region pseudoevent vs real 
events show that the pseudoevent generation is well be- 
haved. Figure [7] shows the distribution of £ in the control 
region for real events and pseudoevents, along with the 
ratio of the two distributions. 



number of events that pass these cuts averaged over all 
pseudoexperiments, and represents the number of events 
in ZBB trigger data that would pass the analysis cuts 
if only SM processes contributed to the observed data. 
We perform a simple counting experiment by comparing 
this background estimate to the number of observed data 
events with the same analysis cuts. 
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FIG. 7: The distribution of ( for the real events (solid line) 
and pseudoevents (triangle points) in the control region (a), 
and their ratio (b). 



VI. SIGNAL SEARCH 
A. From pseudoevents to a background estimate 

To generate the background estimate, we create pseu- 
dodata as described above for events in the signal region. 
We construct a "pseudoexperiment" by sampling from 
the PDFs for each dijet in the ZBB trigger sample. We 
carry out this procedure 10 000 times to create an ensem- 
ble of pseudoexperiments. Each is treated independently 
and is passed through the same set of analysis cuts, which 
will be described in further detail. The resulting num- 
ber of events that pass these cuts is calculated for each 
pseudoexperiment. The background estimate is the mean 



B. Signal to background optimization 

To conduct our search, we investigate the following 
variables, optimizing the last three. 

1- |^o|max cut on tracks that are used by the tagging 
algorithm. 

2. Separation of the two jets (AJJ). 

3. %l>, the impact parameter of a tagged jet. 

4. £, the decay distance of the HV particle. 

The |e?o|ma£c cut is a parameter of the tagging algo- 
rithm. Figure E] shows the behavior of the signal MC 
simulation and background pseudodata, where one pseu- 
doevent is generated for each real event, respectively, for 
the 20 Idolmaz cuts investigated. The standard CDF 
Moliriaa- cut (|rfo|max < 0.15 cm) reduces the efficiency 
of finding secondary vertices from the signal MC events 
by more than half. 
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FIG. 8: Number of signal MC simulation events (triangle 
points) and background pseudoevents (square points) vs the 
6-tagging algorithm cut | dolman- 

The distribution for the signal shows that larger 
I | max cuts allow for more signal acceptance, while the 
background plateaus at about | do | max = 0.70 cm. We 
choose the maximum | do | ma x cut consistent with the 
physical constraints of the CDF detector. The inner de- 
tector contains a beampipe with radius r = 1.26 cm. At- 
tached to the beampipe is a single layer of silicon strips, 
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LaycrOO (LOO). While it is not required that tracks de- 
posit hits in this inner most layer, we want to ensure that 
tracks originating from the primary vertex could hit this 
detector. Thus, the do of tracks originating from a pri- 
mary vertex must be less than the radius of the beam 
pipe. However, the beam line at CDF is not at the exact 
center of the detector. Accounting for this shift, we de- 
termine that tracks with a maximum \oIq\ < 1.0 cm may 
deposit hits in LOO, thus a |<io I maz < 1-0 cm is chosen. 

With the | do | max cut set, we optimize the cuts on 
the other three variables by maximizing S/\fB. The 
cuts chosen for the low- and high-HV mass searches 
(20 GeV/c 2 for the former; 40 and 65 GeV/c 2 for the lat- 
ter) are shown in Table IIVI The searches are optimized 
separately because the low-mass HV results in daugh- 
ter jets that are more collinear. This changes the na- 
ture of the AR cut. For the low- HV- mass search, only a 
AR max < 0.75 cut is imposed; no AR m i n cut is applied. 



TABLE IV: Variable cuts for both the low- and high-HV mass 
searches. 



Variable 


hi 


gh HV 


low HV 






mass 


mass 


\do\max (cm) 


< 


1.0 


< 1.0 




> 


0.75 


n.a. 


^.Rrnax 


< 


2.0 


< 0.75 


\ip\ (both jets) (cm) 


> 


0.11 


> 0.12 


C (cm) 


> 


0.8 


> 0.7 


|C| (cm) 


< Minimum(L a: j / 


1; L xy 2) 



An additional cut shown in Table IIVI is imposed on 
£. The magnitude of £ must be less than the distance 
from the primary vertex to the closest secondary vertex. 
This ensures that the decay point is between the primary 
vertex and both secondary vertices. 

An unanticipated source of background became appar- 
ent when we applied these analysis cuts to the real ZBB 
trigger events. A few events in the low-HV mass sam- 
ple appear to contain a single secondary vertex from a B 
hadron in which some of the decay products are found in 
each of two nearby jet cones. This is a consequence of the 
small cone size used in the jet algorithm. Since this is a 
physical background that we had not thought of, we went 
to the signal MC to find criteria that would remove this 
source of background but not adversely affect the signal 
sensitivity. Two features of this background are that 1) 
the two secondary vertices have very small separation in 
the transverse plane (A^d), and 2) the total invariant 
mass of all the tracks in both vertices (T,m v t x ) is less 
than the 6-quark mass. Table IVl shows the cuts made in 
these variables. When these cuts are added to the low 
HV mass search, the excess background described above 
is removed, while the reduction in the efficiency in the 
signal MC simulation is negligible. 



TABLE V: Additional requirements on the low-HV-mass 
search due to an unanticipated background. 



Variable low HV mass 

AS 2d (cm) > O06 
OR 

T,m vtx (GeV/c 2 ) > 5.0 



C. Results 

With the variable cuts set, we run 10 000 pseudoex- 
periments for both mass searches to estimate the SM 
background. The distributions of the number of pseudo- 
events passing the analysis cuts arc Poisson distributions 
with means /j,i ow = 0.58 and [ihigh = 0.29. The statis- 
tical uncertainties on these numbers are negligible. The 
systematic uncertainty due to the background estimate 
procedure is described in the next section. 

With the same variable cuts, we can also calculate the 
number of expected signal MC events that we would ob- 
tain with the same integrated luminosity as the ZBB trig- 
ger. This is done by calculating the number of events 
that pass the cuts in each signal MC sample and mul- 
tiplying this number by a scale factor consisting of the 
luminosity of the ZBB trigger sample multiplied by the 
cross section for Higgs boson production (gg — > H) di- 
vided by the number of signal MC events generated. The 
Higgs boson cross sections arc obtained from Ref. [Hj]. 
For Mh = 130 GeV/c 2 , the cross section is cr gg ^H = 
858 fb, while for M H = 170 GeV/c 2 , the cross section 
is crgg^H = 349 fb. The branching ratio of the Higgs 
boson is assumed to be 100% to the HV particles, and 
the branching ratio of the HV particles is assumed to be 
100% to bb pairs. 

When calculating the expected number of signal MC, 
two rewcightings are performed in order to account for 
differences between the ZBB trigger data and signal MC 
events. First, we reweight to account for differences in 
the luminosity profile of the signal MC events vs data 
events. The distribution of the number of primary ver- 
tices in data is divided by the same distribution in the 
MC events. The ratio is used as an event- by- event weight, 
ranging from 0.75 to 5.0 depending on the number of pri- 
mary vertices, in order to match the luminosity profile of 
the signal MC simulation to the ZBB trigger data. 

A second reweighting is performed to account for dif- 
ferent trigger efficiencies for different run ranges in order 
to match the data's trigger efficiency to that of the signal 
MC simulation. 

To account for the differences between the MC and 
data tagging efficiencies, a tagging scale factor for the 
tagging algorithm with Idolmaz < 1-0 cm is also applied 
twice (SF ta ggi n g = 0.9 x 0.9 = 0.81), because we have 
two tagged jets [f|. 

Finally, a scale factor (SF tr igger — 1.12 ± 0.11) is ap- 
plied to account for systematic effects present in the ZBB 
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trigger simulation used on the signal MC events [l8j . 

Table I VII shows the results of our search. The number 
of expected signal events is calculated from each of the 
signal MC samples. The number of background events 
is also shown. Both uncertainties are calculated in the 
next section. Of the 6.2 x 10 6 data events in the signal 
region, 124 000 of which have two tagged jets, one event 
passes the analysis cuts in the low-HV mass search, and 
one different event passes these cuts in the high-HV mass 
search. 



VII. SYSTEMATIC UNCERTAINTIES 

The systematic uncertainties in this analysis fall into 
two main categories. The first are systematic effects that 
affect the background estimate. The second are the sys- 
tematic effects that affect the number of expected signal 
MC events. 

In the first category, there are three major sources of 
uncertainty, each corresponding to a step in the pseudo- 
event generation. First, the tagging probability is shifted 
by its statistical uncertainty one sigma in each direction. 
The results are propagated through as a systematic un- 
certainty. 

The flavor composition probabilities used to determine 
the pseudoflavor of the jets result in two more system- 
atic uncertainties: the statistical uncertainty from the 
flavor composition fractional fit and a systematic due to 
the MC simulation over-efficiency in track reconstruction, 
which has a direct impact on the vertex mass of the sec- 
ondary vertex. When additional tracks in the MC are 
reconstructed, they will add to the vertex mass of the sec- 
ondary vertex. For the former, we use one sigma shifts in 
both directions. For the latter, we use an overall 3% re- 
duction in the vertex mass to model a maximal variation 
this over-efficiency would produce [23j]. This reduction 
changes the flavor composition, and propagates through 
as a systematic uncertainty. 

We generate five new ensembles of pseudoevents where 
each is generated with one of the systematic shifts de- 
scribed above. For each, we perform another 10 000 pseu- 
doexperiments as before and compare the background es- 
timate to the central value calculated in Table IVII The 
percent difference is taken as the systematic uncertainty. 

For the last of the three steps in pseudoevent genera- 
tion, the PDF sampling, we turn to the bootstrap tech- 
nique [Hj]. The bootstrap technique measures the sys- 
tematic uncertainty arising from shape uncertainties in 
the PDFs. Effectively the three-dimensional histograms 
are statistically varied within their Poisson statistical 
fluctuations, and a new background estimate is calculated 
using 10 000 new pscudocxpcrimcnts. This procedure is 
itself replicated 200 times. The standard deviation of 
these 200 means is the uncertainty associated with the 
PDF sampling. 

The second category of systematic uncertainties affects 
the signal estimate. These include uncertainties associ- 



ated with: 

1. jet energy scale, 

2. trigger simulation, 

3. tagging scale factor, 

4. parton distribution function, 

5. luminosity. 

The first systematic uncertainty is calculated separately 
for each signal MC sample, while the systematic uncer- 
tainties on the trigger simulation, tagging scale factor, 
and luminosity have the same value across all samples, 
and the parton distribution function has approximately 
the same value for all samples. 

The JES factor on jets is varied up (down) one sigma 
with respect to its central value. The result is that more 
(less) jets pass the i?™ r > 20 GeV cut. This affects the 
number of expected signal MC events differently for each 
sample. 

The ZBB trigger simulation scale factor has an un- 
certainty of 8.9%. The scale factor systematic uncer- 
tainty for the tagging algorithm at the operating point, 
Mo|maa; < 1-0 cm, is 10%. The parton distribution func- 
tion uncertainty is taken from Ref. [22j which documents 
this uncertainty for multiple analyses, including ones that 
use gg — > H production (2.5%). Finally, the luminosity 
uncertainty is 6% j25( ■ 

All the systematic uncertainties calculated are shown 
in Table IVII1 These systematic uncertainties are used in 
the calculation of the limits discussed in Sec. IVIIII 



VIII. FINAL RESULTS AND CONCLUSION 

With all the uncertainties calculated, we form test 
hypotheses consisting of our background estimate along 
with our signal MC. A separate test hypothesis is con- 
structed for each set of masses and lifetimes. We also 
create corresponding null hypotheses consisting only of 
the background estimate for each HV mass search. Ta- 
ble IVIIII shows p- values for each set of masses, showing 
the probability that the null hypothesis has fluctuated to 
the data. 

We do not observe a statistically significant excess, 
thus we proceed to set a limit on the production cross 
section times branching ratio of the HV model for the 
particular masses and lifetimes we studied. A Bayesian 
limit calculator is used for this calculation [26|, [27| . Ta- 
ble IIXI shows the resulting observed limit and median 
expected limit, along with the ±1 and ±2 sigma values 
on the expected limit, all at 95% confidence level (C.L.). 

The counting experiment is performed with a small dis- 
crete number of events, where the background estimate 
is less than one. Thus, the expected number of events 
can only fluctuate up (from zero). The result is that the 
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TABLE VI: Results of the search. Two events remain after all cuts: one event in the low-HV-mass search, the second (different 
event) in the high-HV-mass search. Uncertainties are discussed in Sec. IVIII 



Higgs Boson HV Mass HV life- Expected 


Background 


Number 


Mass (GeV/c 2 ) (GeV/c 2 ) time (cm) Signal MC 


Estimate 


Observed 


low HV mass search 






130 20 1.0 0.64 


0.58 


1 


170 20 1.0 0.074 


0.58 


1 



high HV mass search 

130 40 1.0 0.26 0.29 1 

170 40 1.0 0.38 0.29 1 

170 65 1.0 0.14 0.29 1 

130 40 0.3 0.24 0.29 1 

130 40 2.5 0.10 0.29 1 

130 40 5.0 0.043 0.29 1 



TABLE VII: Summary of systematic uncertainties for the 
background estimate and signal MC simulation. The JES 
is calculated separately for each signal MC sample. 



Uncertainty 


Down (%) Up (%) 


Background estimate - 


low HV mass search 


Data statistics 


±0.039 


Tag prob. statistics 


-7.7 3.4 


Flavor composition 


-0.5 2.75 


Background estimate - 


high HV mass search 


Data statistics 


±0.046 


Tag probability statistics 


±3.9 


Flavor composition 


-0.5 8.9 


Signal MC 


Jet Energy Scale 


-15.6 to -6.3 4.0 to 25.5 


Trigger Uncertainty 


±8.9 


Tagging scale factor 


±10 


PDF 


±2.5 


Luminosity 


±6 



negative sigma expected limits are identical to the me- 
dian limit. Also, the ±1 sigma expectation is 1 event, 
which is what we see. Thus, our limit is the same as the 
±1 sigma expectation. 

Figures [5J [TU1 [11] and [T^] show the results of the limit 
calculation. In Figs. |9l [10]and[TT] the x-axis is the mass 
of the Higgs boson. Figure His for a M HV of 20 GeV/c 2 
corresponding to the low-HV-mass search. Figure [TU] is 
for a Mhv of 40 GeV/c 2 , corresponding to the high-HV- 
mass search. Figure QT] shows the results of the high-HV- 
mass search for a Mhv of 65 GeV/c 2 . Figure fT2l shows 
the limits for M H of 130 GeV/c 2 and M H v of 40 GcV/c 2 
with the HV lifetime on the x-axis. 

In conclusion, we have searched for heavy metastable 
particles that decay into a jet pair at a displaced vertex 
at CDF. No statistically significant excess is observed, 



TABLE VIII: Null hypothesis p-values for this search. 



Higgs Boson HV Mass HV life- p-value 

Mass (GeV/c 2 ) (GeV/c 2 ) time (cm) 



low HV mass search 
130 20 1.0 0.44 

170 20 1.0 0.43 



high HV mass search 

130 40 1.0 0.27 

170 40 1.0 0.26 

170 65 1.0 0.26 

130 40 0.3 0.27 

130 40 2.5 0.27 

130 40 5.0 0.27 



and limits are set on the production cross section times 
branching ratio for the HV phenomenology we have used 
as a benchmark. The results shown for this phenomenol- 
ogy can be used to constrain other models by considering 
the differences of the cross section, branching ratio, and 
the kinematics of the final state. 



TABLE IX: Observed and expected limits at 95% C.L. calculated for different signal MC sampli 



Higgs Boson HV Mass HV life- Observed Expected Limit (pb) 
Mass (GeV/c 2 ) (GeV/c 2 ) time (cm) Limit (pb) -2 a -la median +1 a +2 a 

low HV mass search 
130 20 1.0 6.2 4.3 4.3 4.3 6.2 8.4 

170 20 1.0 22.1 15.2 15.2 15.2 22.1 29.9 



high HV mass search 



130 


40 


1.0 


15.9 


10.5 


10.5 


10.5 


15.9 


21.5 


170 


40 


1.0 


4.4 


2.9 


2.9 


2.9 


4.4 


6.0 


170 


65 


1.0 


11.7 


7.7 


7.7 


7.7 


11.7 


15.7 


130 


40 


0.3 


17.8 


11.7 


11.7 


11.7 


17.8 


24.2 


130 


40 


2.5 


40.7 


26.8 


26.8 


26.8 


40.7 


55.1 


130 


40 


5.0 


94.3 


62.0 


62.0 


62.0 


94.3 


127.9 
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FIG. 9: Observed and expected limits at 95% C.L. with +1 
and +2 a bands for signal MC simulation with HV mass 
20 GeV/c 2 . The hashed line is the Higgs boson production 
cross section in the SM. 



FIG. 11: Observed and expected limits at 95% C.L. with +1 
and +2 a bands for signal MC with HV masses 65 GeV/c 2 . 
The hashed line is the Higgs boson production cross section 
in the SM. 
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FIG. 10: Observed and expected limits at 95% C.L. with +1 
and +2 a bands for signal MC with HV masses 40 GeV/c 2 . 
The hashed line is the Higgs boson production cross section 
in the SM. 



FIG. 12: Observed and expected limits at 95% C.L with +1 
and 2 a bands for signal MC for differing HV particle lifetimes. 
The hashed line is the Higgs boson production cross section 
in the SM. 
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